Predicting the future from first person (egocentric) vision: A survey
نویسندگان
چکیده
Egocentric videos can bring a lot of information about how humans perceive the world and interact with environment, which be beneficial for analysis human behaviour. The research in egocentric video is developing rapidly thanks to increasing availability wearable devices opportunities offered by new large-scale datasets. As computer vision techniques continue develop at an pace, tasks related prediction future are starting evolve from need understanding present. Predicting activities, trajectories interactions objects crucial applications such as human–robot interaction, assistive technologies both industrial daily living scenarios, entertainment virtual or augmented reality. This survey summarizes evolution studies context making overview applications, devices, existing problems, commonly used datasets, models input modalities. Our highlights that methods have significant impact range further efforts should devoted standardization proposal datasets considering real-world scenarios ones vocation.
منابع مشابه
Egocentric Basketball Motion Planning from a Single First-Person Image
We present a model that uses a single first-person image to generate an egocentric basketball motion sequence in the form of a 12D camera configuration trajectory, which encodes a player’s 3D location and 3D head orientation throughout the sequence. To do this, we first introduce a future convolutional neural network (CNN) that predicts an initial sequence of 12D camera configurations, aiming t...
متن کاملAn Overview of First Person Vision and Egocentric Video Analysis for Personal Mobile Wearable Devices
The emergence of new wearable technologies such as action cameras and smart glasses has increased the interest of the computer vision scientists in the First Person perspective. Nowadays, this field is attracting attention and investments of companies aiming to develop commercial devices with First Person Vision recording capabilities. Due to this interest, it is expected to have an increasing ...
متن کاملFuture Person Localization in First-Person Videos
We present a new task that predicts future locations of people observed in first-person videos. Consider a firstperson video stream continuously recorded by a wearable camera. Given a short clip of a person that is extracted from the complete stream, we aim to predict that person’s location in future frames. To facilitate this future person localization ability, we make the following three key ...
متن کاملVisual Motif Discovery via First-Person Vision
Visual motifs are images of visual experiences that are significant and shared across many people, such as an image of an informative sign viewed by many people and that of a familiar social situation such as when interacting with a clerk at a store. The goal of this study is to discover visual motifs from a collection of first-person videos recorded by a wearable camera. To achieve this goal, ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computer Vision and Image Understanding
سال: 2021
ISSN: ['1090-235X', '1077-3142']
DOI: https://doi.org/10.1016/j.cviu.2021.103252